Overview
Brought to you by YData
Dataset statistics
| Number of variables | 31 |
|---|---|
| Number of observations | 100000 |
| Missing cells | 200 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 65.8 MiB |
| Average record size in memory | 690.1 B |
Variable types
| Text | 2 |
|---|---|
| Categorical | 13 |
| Numeric | 14 |
| DateTime | 1 |
| Unsupported | 1 |
avg_monthly_balance is highly overall correlated with pca_1 and 1 other fields | High correlation |
cluster is highly overall correlated with cluster_label | High correlation |
cluster_label is highly overall correlated with cluster | High correlation |
credit_to_income is highly overall correlated with income and 5 other fields | High correlation |
income is highly overall correlated with credit_to_income and 4 other fields | High correlation |
income_credit_interaction is highly overall correlated with credit_to_income and 4 other fields | High correlation |
income_log is highly overall correlated with credit_to_income and 4 other fields | High correlation |
loan_amount is highly overall correlated with credit_to_income and 2 other fields | High correlation |
loan_approved is highly overall correlated with potential_data_leakage | High correlation |
loan_log is highly overall correlated with credit_to_income and 2 other fields | High correlation |
pca_1 is highly overall correlated with avg_monthly_balance and 3 other fields | High correlation |
pca_2 is highly overall correlated with credit_to_income and 5 other fields | High correlation |
potential_data_leakage is highly overall correlated with loan_approved | High correlation |
transaction_amount is highly overall correlated with transaction_type | High correlation |
transaction_type is highly overall correlated with transaction_amount | High correlation |
txn_intensity is highly overall correlated with avg_monthly_balance | High correlation |
credit_to_income is highly skewed (γ1 = 49.72325454) | Skewed |
txn_intensity is highly skewed (γ1 = -25.32234104) | Skewed |
month is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
loan_amount has 3000 (3.0%) zeros | Zeros |
loan_log has 3000 (3.0%) zeros | Zeros |
credit_to_income has 3000 (3.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-14 16:48:51.608993 |
|---|---|
| Analysis finished | 2025-04-14 16:49:12.589397 |
| Duration | 20.98 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
customer_id
Text
| Distinct | 2500 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.3 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CUST00001 |
|---|---|
| 2nd row | CUST00001 |
| 3rd row | CUST00001 |
| 4th row | CUST00001 |
| 5th row | CUST00001 |
| Value | Count | Frequency (%) |
| cust00001 | 40 | < 0.1% |
| cust00020 | 40 | < 0.1% |
| cust00039 | 40 | < 0.1% |
| cust00008 | 40 | < 0.1% |
| cust00010 | 40 | < 0.1% |
| cust00012 | 40 | < 0.1% |
| cust00014 | 40 | < 0.1% |
| cust00017 | 40 | < 0.1% |
| cust00019 | 40 | < 0.1% |
| cust00023 | 40 | < 0.1% |
| Other values (2490) | 99600 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 141600 | |
| C | 100000 | |
| U | 100000 | |
| S | 100000 | |
| T | 100000 | |
| 1 | 41560 | 4.6% |
| 4 | 41440 | 4.6% |
| 3 | 40640 | 4.5% |
| 8 | 40640 | 4.5% |
| 7 | 40560 | 4.5% |
| Other values (4) | 153560 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 900000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 141600 | |
| C | 100000 | |
| U | 100000 | |
| S | 100000 | |
| T | 100000 | |
| 1 | 41560 | 4.6% |
| 4 | 41440 | 4.6% |
| 3 | 40640 | 4.5% |
| 8 | 40640 | 4.5% |
| 7 | 40560 | 4.5% |
| Other values (4) | 153560 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 900000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 141600 | |
| C | 100000 | |
| U | 100000 | |
| S | 100000 | |
| T | 100000 | |
| 1 | 41560 | 4.6% |
| 4 | 41440 | 4.6% |
| 3 | 40640 | 4.5% |
| 8 | 40640 | 4.5% |
| 7 | 40560 | 4.5% |
| Other values (4) | 153560 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 900000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 141600 | |
| C | 100000 | |
| U | 100000 | |
| S | 100000 | |
| T | 100000 | |
| 1 | 41560 | 4.6% |
| 4 | 41440 | 4.6% |
| 3 | 40640 | 4.5% |
| 8 | 40640 | 4.5% |
| 7 | 40560 | 4.5% |
| Other values (4) | 153560 |
account_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.1 MiB |
| Checking | |
|---|---|
| Salary | |
| Savings |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.0036 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Checking |
|---|---|
| 2nd row | Checking |
| 3rd row | Checking |
| 4th row | Checking |
| 5th row | Checking |
Common Values
| Value | Count | Frequency (%) |
| Checking | 33960 | |
| Salary | 33600 | |
| Savings | 32440 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| checking | 33960 | |
| salary | 33600 | |
| savings | 32440 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 99640 | |
| i | 66400 | |
| n | 66400 | |
| g | 66400 | |
| S | 66040 | 9.4% |
| C | 33960 | 4.8% |
| h | 33960 | 4.8% |
| e | 33960 | 4.8% |
| c | 33960 | 4.8% |
| k | 33960 | 4.8% |
| Other values (5) | 165680 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 700360 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 99640 | |
| i | 66400 | |
| n | 66400 | |
| g | 66400 | |
| S | 66040 | 9.4% |
| C | 33960 | 4.8% |
| h | 33960 | 4.8% |
| e | 33960 | 4.8% |
| c | 33960 | 4.8% |
| k | 33960 | 4.8% |
| Other values (5) | 165680 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 700360 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 99640 | |
| i | 66400 | |
| n | 66400 | |
| g | 66400 | |
| S | 66040 | 9.4% |
| C | 33960 | 4.8% |
| h | 33960 | 4.8% |
| e | 33960 | 4.8% |
| c | 33960 | 4.8% |
| k | 33960 | 4.8% |
| Other values (5) | 165680 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 700360 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 99640 | |
| i | 66400 | |
| n | 66400 | |
| g | 66400 | |
| S | 66040 | 9.4% |
| C | 33960 | 4.8% |
| h | 33960 | 4.8% |
| e | 33960 | 4.8% |
| c | 33960 | 4.8% |
| k | 33960 | 4.8% |
| Other values (5) | 165680 |
income
Real number (ℝ)
High correlation 
| Distinct | 2375 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49958.179 |
| Minimum | -10969.05 |
|---|---|
| Maximum | 106662.35 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 200 |
| Negative (%) | 0.2% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | -10969.05 |
|---|---|
| 5-th percentile | 25439.431 |
| Q1 | 40262.615 |
| median | 50074.445 |
| Q3 | 60051.6 |
| 95-th percentile | 74269.95 |
| Maximum | 106662.35 |
| Range | 117631.4 |
| Interquartile range (IQR) | 19788.985 |
Descriptive statistics
| Standard deviation | 14973.502 |
|---|---|
| Coefficient of variation (CV) | 0.29972073 |
| Kurtosis | 0.35824293 |
| Mean | 49958.179 |
| Median Absolute Deviation (MAD) | 9900.69 |
| Skewness | -0.062855809 |
| Sum | 4.9958179 × 109 |
| Variance | 2.2420576 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50074.445 | 5040 | 5.0% |
| 38985.86 | 40 | < 0.1% |
| 30401.1 | 40 | < 0.1% |
| 76327.72 | 40 | < 0.1% |
| 63600.74 | 40 | < 0.1% |
| 66584.91 | 40 | < 0.1% |
| 47304.1 | 40 | < 0.1% |
| 46787.36 | 40 | < 0.1% |
| 49572.1 | 40 | < 0.1% |
| 60738.07 | 40 | < 0.1% |
| Other values (2365) | 94600 |
| Value | Count | Frequency (%) |
| -10969.05 | 40 | |
| -10286.82 | 40 | |
| -4293.06 | 40 | |
| -1813.82 | 40 | |
| -808.52 | 40 | |
| 38.21 | 40 | |
| 4735.3 | 40 | |
| 9790.83 | 40 | |
| 10187.56 | 40 | |
| 10293.32 | 40 |
| Value | Count | Frequency (%) |
| 106662.35 | 40 | |
| 103346.87 | 40 | |
| 102705.14 | 40 | |
| 98447.44 | 40 | |
| 98239.21 | 40 | |
| 97233.45 | 40 | |
| 96698.78 | 40 | |
| 94039.62 | 40 | |
| 93755.49 | 40 | |
| 93420.26 | 40 |
credit_score
Real number (ℝ)
| Distinct | 264 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 680.5408 |
| Minimum | 517 |
|---|---|
| Maximum | 855 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 517 |
|---|---|
| 5-th percentile | 604 |
| Q1 | 649 |
| median | 680 |
| Q3 | 711 |
| 95-th percentile | 761.05 |
| Maximum | 855 |
| Range | 338 |
| Interquartile range (IQR) | 62 |
Descriptive statistics
| Standard deviation | 47.493398 |
|---|---|
| Coefficient of variation (CV) | 0.069787731 |
| Kurtosis | 0.20725817 |
| Mean | 680.5408 |
| Median Absolute Deviation (MAD) | 31 |
| Skewness | 0.065335142 |
| Sum | 68054080 |
| Variance | 2255.6229 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 680 | 6200 | 6.2% |
| 656 | 1200 | 1.2% |
| 695 | 1080 | 1.1% |
| 705 | 1000 | 1.0% |
| 679 | 960 | 1.0% |
| 706 | 920 | 0.9% |
| 713 | 880 | 0.9% |
| 711 | 880 | 0.9% |
| 703 | 880 | 0.9% |
| 660 | 880 | 0.9% |
| Other values (254) | 85120 |
| Value | Count | Frequency (%) |
| 517 | 40 | |
| 525 | 40 | |
| 526 | 40 | |
| 532 | 40 | |
| 537 | 40 | |
| 539 | 40 | |
| 546 | 40 | |
| 550 | 40 | |
| 551 | 40 | |
| 552 | 80 |
| Value | Count | Frequency (%) |
| 855 | 40 | |
| 830 | 40 | |
| 828 | 40 | |
| 827 | 40 | |
| 826 | 40 | |
| 825 | 40 | |
| 824 | 40 | |
| 822 | 40 | |
| 820 | 40 | |
| 815 | 40 |
employment_status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| Self-employed | |
|---|---|
| Employed | |
| Unemployed |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 10.3748 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Self-employed |
|---|---|
| 2nd row | Self-employed |
| 3rd row | Self-employed |
| 4th row | Self-employed |
| 5th row | Self-employed |
Common Values
| Value | Count | Frequency (%) |
| Self-employed | 34600 | |
| Employed | 33160 | |
| Unemployed | 32240 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| self-employed | 34600 | |
| employed | 33160 | |
| unemployed | 32240 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 201440 | |
| l | 134600 | |
| m | 100000 | |
| p | 100000 | |
| o | 100000 | |
| y | 100000 | |
| d | 100000 | |
| S | 34600 | 3.3% |
| f | 34600 | 3.3% |
| - | 34600 | 3.3% |
| Other values (3) | 97640 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1037480 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 201440 | |
| l | 134600 | |
| m | 100000 | |
| p | 100000 | |
| o | 100000 | |
| y | 100000 | |
| d | 100000 | |
| S | 34600 | 3.3% |
| f | 34600 | 3.3% |
| - | 34600 | 3.3% |
| Other values (3) | 97640 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1037480 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 201440 | |
| l | 134600 | |
| m | 100000 | |
| p | 100000 | |
| o | 100000 | |
| y | 100000 | |
| d | 100000 | |
| S | 34600 | 3.3% |
| f | 34600 | 3.3% |
| - | 34600 | 3.3% |
| Other values (3) | 97640 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1037480 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 201440 | |
| l | 134600 | |
| m | 100000 | |
| p | 100000 | |
| o | 100000 | |
| y | 100000 | |
| d | 100000 | |
| S | 34600 | 3.3% |
| f | 34600 | 3.3% |
| - | 34600 | 3.3% |
| Other values (3) | 97640 |
risk_segment
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| Medium | |
|---|---|
| Low | |
| High |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.3392 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Medium |
|---|---|
| 2nd row | Medium |
| 3rd row | Medium |
| 4th row | Medium |
| 5th row | Medium |
Common Values
| Value | Count | Frequency (%) |
| Medium | 33720 | |
| Low | 33520 | |
| High | 32760 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| medium | 33720 | |
| low | 33520 | |
| high | 32760 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 66480 | |
| M | 33720 | |
| e | 33720 | |
| d | 33720 | |
| u | 33720 | |
| m | 33720 | |
| L | 33520 | |
| o | 33520 | |
| w | 33520 | |
| H | 32760 | |
| Other values (2) | 65520 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 433920 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 66480 | |
| M | 33720 | |
| e | 33720 | |
| d | 33720 | |
| u | 33720 | |
| m | 33720 | |
| L | 33520 | |
| o | 33520 | |
| w | 33520 | |
| H | 32760 | |
| Other values (2) | 65520 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 433920 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 66480 | |
| M | 33720 | |
| e | 33720 | |
| d | 33720 | |
| u | 33720 | |
| m | 33720 | |
| L | 33520 | |
| o | 33520 | |
| w | 33520 | |
| H | 32760 | |
| Other values (2) | 65520 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 433920 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 66480 | |
| M | 33720 | |
| e | 33720 | |
| d | 33720 | |
| u | 33720 | |
| m | 33720 | |
| L | 33520 | |
| o | 33520 | |
| w | 33520 | |
| H | 32760 | |
| Other values (2) | 65520 |
avg_monthly_balance
Real number (ℝ)
High correlation 
| Distinct | 2499 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20141.573 |
| Minimum | -15218.08 |
|---|---|
| Maximum | 53373.82 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 2160 |
| Negative (%) | 2.2% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | -15218.08 |
|---|---|
| 5-th percentile | 3979.89 |
| Q1 | 13211.865 |
| median | 19875.915 |
| Q3 | 26853.403 |
| 95-th percentile | 36973.173 |
| Maximum | 53373.82 |
| Range | 68591.9 |
| Interquartile range (IQR) | 13641.537 |
Descriptive statistics
| Standard deviation | 10018.577 |
|---|---|
| Coefficient of variation (CV) | 0.49740788 |
| Kurtosis | -0.056890561 |
| Mean | 20141.573 |
| Median Absolute Deviation (MAD) | 6829.245 |
| Skewness | 0.043922871 |
| Sum | 2.0141573 × 109 |
| Variance | 1.0037189 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23395.37 | 80 | 0.1% |
| 13080.97 | 40 | < 0.1% |
| 22834.35 | 40 | < 0.1% |
| 7103.43 | 40 | < 0.1% |
| 31213.73 | 40 | < 0.1% |
| 16594.96 | 40 | < 0.1% |
| 26636.95 | 40 | < 0.1% |
| 20859.61 | 40 | < 0.1% |
| 27828.78 | 40 | < 0.1% |
| 37585.65 | 40 | < 0.1% |
| Other values (2489) | 99560 |
| Value | Count | Frequency (%) |
| -15218.08 | 40 | |
| -11969.03 | 40 | |
| -9995.26 | 40 | |
| -9108.36 | 40 | |
| -8198 | 40 | |
| -7291.8 | 40 | |
| -6901.33 | 40 | |
| -6724.83 | 40 | |
| -6702.78 | 40 | |
| -6655.13 | 40 |
| Value | Count | Frequency (%) |
| 53373.82 | 40 | |
| 51850.94 | 40 | |
| 49394.28 | 40 | |
| 47802.02 | 40 | |
| 47691.14 | 40 | |
| 47577.26 | 40 | |
| 47260.78 | 40 | |
| 46963.45 | 40 | |
| 46853.93 | 40 | |
| 46672.96 | 40 |
num_transactions
Real number (ℝ)
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.9356 |
| Minimum | 15 |
|---|---|
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 26 |
| median | 30 |
| Q3 | 34 |
| 95-th percentile | 39 |
| Maximum | 52 |
| Range | 37 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 5.3437569 |
|---|---|
| Coefficient of variation (CV) | 0.17850843 |
| Kurtosis | -0.025579477 |
| Mean | 29.9356 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.081991953 |
| Sum | 2993560 |
| Variance | 28.555738 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28 | 7800 | 7.8% |
| 29 | 7400 | 7.4% |
| 30 | 7080 | 7.1% |
| 31 | 6960 | 7.0% |
| 27 | 6960 | 7.0% |
| 32 | 6760 | 6.8% |
| 33 | 6120 | 6.1% |
| 34 | 5600 | 5.6% |
| 26 | 5440 | 5.4% |
| 35 | 4880 | 4.9% |
| Other values (24) | 35000 |
| Value | Count | Frequency (%) |
| 15 | 120 | 0.1% |
| 16 | 240 | 0.2% |
| 17 | 400 | 0.4% |
| 18 | 920 | 0.9% |
| 19 | 880 | 0.9% |
| 20 | 1320 | 1.3% |
| 21 | 1560 | 1.6% |
| 22 | 2840 | |
| 23 | 3280 | |
| 24 | 4080 |
| Value | Count | Frequency (%) |
| 52 | 40 | < 0.1% |
| 49 | 40 | < 0.1% |
| 46 | 80 | 0.1% |
| 45 | 200 | 0.2% |
| 44 | 240 | 0.2% |
| 43 | 640 | |
| 42 | 520 | 0.5% |
| 41 | 920 | |
| 40 | 1200 | |
| 39 | 1400 |
transaction_amount
Real number (ℝ)
High correlation 
| Distinct | 98406 |
|---|---|
| Distinct (%) | 98.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19673.189 |
| Minimum | 50.27 |
|---|---|
| Maximum | 99986.39 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 50.27 |
|---|---|
| 5-th percentile | 1079.987 |
| Q1 | 4767.6475 |
| median | 9338.06 |
| Q3 | 25246.022 |
| 95-th percentile | 75163.58 |
| Maximum | 99986.39 |
| Range | 99936.12 |
| Interquartile range (IQR) | 20478.375 |
Descriptive statistics
| Standard deviation | 22529.583 |
|---|---|
| Coefficient of variation (CV) | 1.1451923 |
| Kurtosis | 2.3807118 |
| Mean | 19673.189 |
| Median Absolute Deviation (MAD) | 6901.98 |
| Skewness | 1.7338031 |
| Sum | 1.9673189 × 109 |
| Variance | 5.0758213 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6410.59 | 4 | < 0.1% |
| 5903.81 | 3 | < 0.1% |
| 5422.61 | 3 | < 0.1% |
| 3632.84 | 3 | < 0.1% |
| 14496.86 | 3 | < 0.1% |
| 13085.33 | 3 | < 0.1% |
| 2255.91 | 3 | < 0.1% |
| 7469.16 | 3 | < 0.1% |
| 9975.38 | 3 | < 0.1% |
| 3709.57 | 3 | < 0.1% |
| Other values (98396) | 99969 |
| Value | Count | Frequency (%) |
| 50.27 | 1 | |
| 50.57 | 1 | |
| 50.78 | 1 | |
| 50.97 | 1 | |
| 50.98 | 1 | |
| 51.12 | 1 | |
| 51.29 | 1 | |
| 51.5 | 1 | |
| 52.21 | 1 | |
| 52.6 | 1 |
| Value | Count | Frequency (%) |
| 99986.39 | 1 | |
| 99981.98 | 1 | |
| 99979.81 | 1 | |
| 99977.77 | 1 | |
| 99975.67 | 1 | |
| 99962.27 | 1 | |
| 99956.55 | 1 | |
| 99939.32 | 1 | |
| 99933.66 | 1 | |
| 99924.21 | 1 |
transaction_type
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| UPI | |
|---|---|
| Debit | |
| Credit | |
| POS | |
| ATM |
Length
| Max length | 6 |
|---|---|
| Median length | 3 |
| Mean length | 4.00068 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ATM |
|---|---|
| 2nd row | UPI |
| 3rd row | UPI |
| 4th row | ATM |
| 5th row | Debit |
Common Values
| Value | Count | Frequency (%) |
| UPI | 20056 | |
| Debit | 20046 | |
| Credit | 19992 | |
| POS | 19963 | |
| ATM | 19943 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| upi | 20056 | |
| debit | 20046 | |
| credit | 19992 | |
| pos | 19963 | |
| atm | 19943 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 40038 | 10.0% |
| i | 40038 | 10.0% |
| t | 40038 | 10.0% |
| P | 40019 | 10.0% |
| U | 20056 | 5.0% |
| I | 20056 | 5.0% |
| D | 20046 | 5.0% |
| b | 20046 | 5.0% |
| C | 19992 | 5.0% |
| r | 19992 | 5.0% |
| Other values (6) | 119747 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 400068 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 40038 | 10.0% |
| i | 40038 | 10.0% |
| t | 40038 | 10.0% |
| P | 40019 | 10.0% |
| U | 20056 | 5.0% |
| I | 20056 | 5.0% |
| D | 20046 | 5.0% |
| b | 20046 | 5.0% |
| C | 19992 | 5.0% |
| r | 19992 | 5.0% |
| Other values (6) | 119747 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 400068 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 40038 | 10.0% |
| i | 40038 | 10.0% |
| t | 40038 | 10.0% |
| P | 40019 | 10.0% |
| U | 20056 | 5.0% |
| I | 20056 | 5.0% |
| D | 20046 | 5.0% |
| b | 20046 | 5.0% |
| C | 19992 | 5.0% |
| r | 19992 | 5.0% |
| Other values (6) | 119747 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 400068 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 40038 | 10.0% |
| i | 40038 | 10.0% |
| t | 40038 | 10.0% |
| P | 40019 | 10.0% |
| U | 20056 | 5.0% |
| I | 20056 | 5.0% |
| D | 20046 | 5.0% |
| b | 20046 | 5.0% |
| C | 19992 | 5.0% |
| r | 19992 | 5.0% |
| Other values (6) | 119747 |
transaction_date
Date
| Distinct | 90 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| Minimum | 2025-01-02 00:00:00 |
|---|---|
| Maximum | 2025-12-04 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
txn_location
Text
| Distinct | 38007 |
|---|---|
| Distinct (%) | 38.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 21 |
| Mean length | 12.04348 |
| Min length | 5 |
Unique
| Unique | 20929 ? |
|---|---|
| Unique (%) | 20.9% |
Sample
| 1st row | Moralesfort |
|---|---|
| 2nd row | Deborahburgh |
| 3rd row | South Adam |
| 4th row | Marymouth |
| 5th row | Lake Susanmouth |
| Value | Count | Frequency (%) |
| new | 7186 | 4.8% |
| east | 7185 | 4.8% |
| south | 7167 | 4.8% |
| west | 7106 | 4.7% |
| port | 7099 | 4.7% |
| lake | 7062 | 4.7% |
| north | 7025 | 4.7% |
| michael | 558 | 0.4% |
| james | 406 | 0.3% |
| david | 391 | 0.3% |
| Other values (19474) | 98645 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 116024 | 9.6% |
| t | 95027 | 7.9% |
| r | 93834 | 7.8% |
| a | 93521 | 7.8% |
| o | 82297 | 6.8% |
| h | 67413 | 5.6% |
| n | 65537 | 5.4% |
| i | 59673 | 5.0% |
| s | 56398 | 4.7% |
| 49830 | 4.1% | |
| Other values (43) | 424794 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1204348 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 116024 | 9.6% |
| t | 95027 | 7.9% |
| r | 93834 | 7.8% |
| a | 93521 | 7.8% |
| o | 82297 | 6.8% |
| h | 67413 | 5.6% |
| n | 65537 | 5.4% |
| i | 59673 | 5.0% |
| s | 56398 | 4.7% |
| 49830 | 4.1% | |
| Other values (43) | 424794 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1204348 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 116024 | 9.6% |
| t | 95027 | 7.9% |
| r | 93834 | 7.8% |
| a | 93521 | 7.8% |
| o | 82297 | 6.8% |
| h | 67413 | 5.6% |
| n | 65537 | 5.4% |
| i | 59673 | 5.0% |
| s | 56398 | 4.7% |
| 49830 | 4.1% | |
| Other values (43) | 424794 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1204348 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 116024 | 9.6% |
| t | 95027 | 7.9% |
| r | 93834 | 7.8% |
| a | 93521 | 7.8% |
| o | 82297 | 6.8% |
| h | 67413 | 5.6% |
| n | 65537 | 5.4% |
| i | 59673 | 5.0% |
| s | 56398 | 4.7% |
| 49830 | 4.1% | |
| Other values (43) | 424794 |
device_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.9 MiB |
| Web | |
|---|---|
| Branch | |
| Mobile |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 4.99515 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mobile |
|---|---|
| 2nd row | Web |
| 3rd row | Branch |
| 4th row | Branch |
| 5th row | Branch |
Common Values
| Value | Count | Frequency (%) |
| Web | 33495 | |
| Branch | 33288 | |
| Mobile | 33217 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| web | 33495 | |
| branch | 33288 | |
| mobile | 33217 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 66712 | |
| b | 66712 | |
| W | 33495 | 6.7% |
| B | 33288 | 6.7% |
| r | 33288 | 6.7% |
| a | 33288 | 6.7% |
| n | 33288 | 6.7% |
| c | 33288 | 6.7% |
| h | 33288 | 6.7% |
| M | 33217 | 6.6% |
| Other values (3) | 99651 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 499515 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 66712 | |
| b | 66712 | |
| W | 33495 | 6.7% |
| B | 33288 | 6.7% |
| r | 33288 | 6.7% |
| a | 33288 | 6.7% |
| n | 33288 | 6.7% |
| c | 33288 | 6.7% |
| h | 33288 | 6.7% |
| M | 33217 | 6.6% |
| Other values (3) | 99651 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 499515 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 66712 | |
| b | 66712 | |
| W | 33495 | 6.7% |
| B | 33288 | 6.7% |
| r | 33288 | 6.7% |
| a | 33288 | 6.7% |
| n | 33288 | 6.7% |
| c | 33288 | 6.7% |
| h | 33288 | 6.7% |
| M | 33217 | 6.6% |
| Other values (3) | 99651 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 499515 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 66712 | |
| b | 66712 | |
| W | 33495 | 6.7% |
| B | 33288 | 6.7% |
| r | 33288 | 6.7% |
| a | 33288 | 6.7% |
| n | 33288 | 6.7% |
| c | 33288 | 6.7% |
| h | 33288 | 6.7% |
| M | 33217 | 6.6% |
| Other values (3) | 99651 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 50560 | |
| 1 | 49440 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 50560 | |
| 1 | 49440 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 50560 | |
| 1 | 49440 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 50560 | |
| 1 | 49440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 50560 | |
| 1 | 49440 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 50560 | |
| 1 | 49440 |
loan_approved
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 50600 | |
| 1 | 49400 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 50600 | |
| 1 | 49400 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 50600 | |
| 1 | 49400 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 50600 | |
| 1 | 49400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 50600 | |
| 1 | 49400 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 50600 | |
| 1 | 49400 |
loan_amount
Real number (ℝ)
High correlation  Zeros 
| Distinct | 2364 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 192899.86 |
| Minimum | 0 |
|---|---|
| Maximum | 493705.74 |
| Zeros | 3000 |
| Zeros (%) | 3.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 43534.696 |
| Q1 | 144999.65 |
| median | 194831.33 |
| Q3 | 248171.71 |
| 95-th percentile | 317601.55 |
| Maximum | 493705.74 |
| Range | 493705.74 |
| Interquartile range (IQR) | 103172.05 |
Descriptive statistics
| Standard deviation | 79278.609 |
|---|---|
| Coefficient of variation (CV) | 0.41098324 |
| Kurtosis | 0.16434065 |
| Mean | 192899.86 |
| Median Absolute Deviation (MAD) | 51583.17 |
| Skewness | -0.21495432 |
| Sum | 1.9289986 × 1010 |
| Variance | 6.2850979 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3000 | 3.0% |
| 194831.325 | 2520 | 2.5% |
| 237115.06 | 40 | < 0.1% |
| 227446.93 | 40 | < 0.1% |
| 304148.93 | 40 | < 0.1% |
| 259514.88 | 40 | < 0.1% |
| 272033.41 | 40 | < 0.1% |
| 245999.27 | 40 | < 0.1% |
| 156738.55 | 40 | < 0.1% |
| 230617.62 | 40 | < 0.1% |
| Other values (2354) | 94160 |
| Value | Count | Frequency (%) |
| 0 | 3000 | |
| 3362.64 | 40 | < 0.1% |
| 4238.66 | 40 | < 0.1% |
| 4324.61 | 40 | < 0.1% |
| 4366.11 | 40 | < 0.1% |
| 5376.96 | 40 | < 0.1% |
| 6105.23 | 40 | < 0.1% |
| 6460.15 | 40 | < 0.1% |
| 9081.01 | 40 | < 0.1% |
| 10010.47 | 40 | < 0.1% |
| Value | Count | Frequency (%) |
| 493705.74 | 40 | |
| 478715.7 | 40 | |
| 452691.55 | 40 | |
| 450780.72 | 40 | |
| 426378.25 | 40 | |
| 424249.93 | 40 | |
| 417865.54 | 40 | |
| 403457.74 | 40 | |
| 402643.53 | 40 | |
| 400216.87 | 40 |
loan_purpose
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| Business | |
|---|---|
| Car | |
| Home | |
| Education | |
| Personal |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.372 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Personal |
|---|---|
| 2nd row | Personal |
| 3rd row | Personal |
| 4th row | Personal |
| 5th row | Personal |
Common Values
| Value | Count | Frequency (%) |
| Business | 20680 | |
| Car | 20440 | |
| Home | 20080 | |
| Education | 19720 | |
| Personal | 19080 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| business | 20680 | |
| car | 20440 | |
| home | 20080 | |
| education | 19720 | |
| personal | 19080 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 81120 | |
| e | 59840 | 9.4% |
| n | 59480 | 9.3% |
| a | 59240 | 9.3% |
| o | 58880 | 9.2% |
| i | 40400 | 6.3% |
| u | 40400 | 6.3% |
| r | 39520 | 6.2% |
| B | 20680 | 3.2% |
| C | 20440 | 3.2% |
| Other values (8) | 157200 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 637200 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 81120 | |
| e | 59840 | 9.4% |
| n | 59480 | 9.3% |
| a | 59240 | 9.3% |
| o | 58880 | 9.2% |
| i | 40400 | 6.3% |
| u | 40400 | 6.3% |
| r | 39520 | 6.2% |
| B | 20680 | 3.2% |
| C | 20440 | 3.2% |
| Other values (8) | 157200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 637200 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 81120 | |
| e | 59840 | 9.4% |
| n | 59480 | 9.3% |
| a | 59240 | 9.3% |
| o | 58880 | 9.2% |
| i | 40400 | 6.3% |
| u | 40400 | 6.3% |
| r | 39520 | 6.2% |
| B | 20680 | 3.2% |
| C | 20440 | 3.2% |
| Other values (8) | 157200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 637200 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 81120 | |
| e | 59840 | 9.4% |
| n | 59480 | 9.3% |
| a | 59240 | 9.3% |
| o | 58880 | 9.2% |
| i | 40400 | 6.3% |
| u | 40400 | 6.3% |
| r | 39520 | 6.2% |
| B | 20680 | 3.2% |
| C | 20440 | 3.2% |
| Other values (8) | 157200 |
credit_utilization
Real number (ℝ)
| Distinct | 101 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.490656 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 480 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.05 |
| Q1 | 0.24 |
| median | 0.49 |
| Q3 | 0.73 |
| 95-th percentile | 0.94 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.49 |
Descriptive statistics
| Standard deviation | 0.28520348 |
|---|---|
| Coefficient of variation (CV) | 0.58126972 |
| Kurtosis | -1.162722 |
| Mean | 0.490656 |
| Median Absolute Deviation (MAD) | 0.25 |
| Skewness | 0.042016701 |
| Sum | 49065.6 |
| Variance | 0.081341023 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.21 | 1480 | 1.5% |
| 0.49 | 1440 | 1.4% |
| 0.61 | 1360 | 1.4% |
| 0.1 | 1320 | 1.3% |
| 0.57 | 1320 | 1.3% |
| 0.47 | 1280 | 1.3% |
| 0.26 | 1280 | 1.3% |
| 0.54 | 1280 | 1.3% |
| 0.85 | 1280 | 1.3% |
| 0.6 | 1280 | 1.3% |
| Other values (91) | 86680 |
| Value | Count | Frequency (%) |
| 0 | 480 | 0.5% |
| 0.01 | 1040 | |
| 0.02 | 880 | |
| 0.03 | 1080 | |
| 0.04 | 960 | |
| 0.05 | 1240 | |
| 0.06 | 1200 | |
| 0.07 | 800 | |
| 0.08 | 1000 | |
| 0.09 | 840 |
| Value | Count | Frequency (%) |
| 1 | 400 | 0.4% |
| 0.99 | 680 | |
| 0.98 | 1040 | |
| 0.97 | 880 | |
| 0.96 | 800 | |
| 0.95 | 880 | |
| 0.94 | 960 | |
| 0.93 | 1120 | |
| 0.92 | 1160 | |
| 0.91 | 1080 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 50760 | |
| 0 | 49240 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 50760 | |
| 0 | 49240 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 50760 | |
| 0 | 49240 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 50760 | |
| 0 | 49240 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 50760 | |
| 0 | 49240 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 50760 | |
| 0 | 49240 |
branch_rating
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| 4 | |
|---|---|
| 5 | |
| 2 | |
| 3 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 4 | 21120 | |
| 5 | 20720 | |
| 2 | 20640 | |
| 3 | 18880 | |
| 1 | 18640 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4 | 21120 | |
| 5 | 20720 | |
| 2 | 20640 | |
| 3 | 18880 | |
| 1 | 18640 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 21120 | |
| 5 | 20720 | |
| 2 | 20640 | |
| 3 | 18880 | |
| 1 | 18640 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4 | 21120 | |
| 5 | 20720 | |
| 2 | 20640 | |
| 3 | 18880 | |
| 1 | 18640 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4 | 21120 | |
| 5 | 20720 | |
| 2 | 20640 | |
| 3 | 18880 | |
| 1 | 18640 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4 | 21120 | |
| 5 | 20720 | |
| 2 | 20640 | |
| 3 | 18880 | |
| 1 | 18640 |
potential_data_leakage
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 52160 | |
| 1 | 47840 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 52160 | |
| 1 | 47840 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 52160 | |
| 1 | 47840 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 52160 | |
| 1 | 47840 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 52160 | |
| 1 | 47840 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 52160 | |
| 1 | 47840 |
pca_1
Real number (ℝ)
High correlation 
| Distinct | 2500 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -1.1368684 × 10-18 |
| Minimum | -3.6178642 |
|---|---|
| Maximum | 3.546399 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 49480 |
| Negative (%) | 49.5% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | -3.6178642 |
|---|---|
| 5-th percentile | -1.6965701 |
| Q1 | -0.69716289 |
| median | 0.0083633478 |
| Q3 | 0.69458212 |
| 95-th percentile | 1.6739771 |
| Maximum | 3.546399 |
| Range | 7.1642631 |
| Interquartile range (IQR) | 1.391745 |
Descriptive statistics
| Standard deviation | 1.0258463 |
|---|---|
| Coefficient of variation (CV) | -9.0234395 × 1017 |
| Kurtosis | 0.1138397 |
| Mean | -1.1368684 × 10-18 |
| Median Absolute Deviation (MAD) | 0.69628493 |
| Skewness | 0.015404185 |
| Sum | 1.2505552 × 10-12 |
| Variance | 1.0523606 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -0.2041517771 | 40 | < 0.1% |
| 0.02187843649 | 40 | < 0.1% |
| -0.6499245679 | 40 | < 0.1% |
| -0.5115197416 | 40 | < 0.1% |
| 2.062301976 | 40 | < 0.1% |
| 0.7071842468 | 40 | < 0.1% |
| 1.170570834 | 40 | < 0.1% |
| -0.1870268049 | 40 | < 0.1% |
| 1.424886461 | 40 | < 0.1% |
| 0.5010734353 | 40 | < 0.1% |
| Other values (2490) | 99600 |
| Value | Count | Frequency (%) |
| -3.617864168 | 40 | |
| -3.465897028 | 40 | |
| -3.318677398 | 40 | |
| -3.203900036 | 40 | |
| -2.95564782 | 40 | |
| -2.836249772 | 40 | |
| -2.811685225 | 40 | |
| -2.770784522 | 40 | |
| -2.727102914 | 40 | |
| -2.725494672 | 40 |
| Value | Count | Frequency (%) |
| 3.546398975 | 40 | |
| 3.432779605 | 40 | |
| 3.276167244 | 40 | |
| 3.246916211 | 40 | |
| 3.152939047 | 40 | |
| 3.105105046 | 40 | |
| 3.09636372 | 40 | |
| 2.947964777 | 40 | |
| 2.941294121 | 40 | |
| 2.901912694 | 40 |
pca_2
Real number (ℝ)
High correlation 
| Distinct | 2500 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5011104 × 10-17 |
| Minimum | -3.3042718 |
|---|---|
| Maximum | 3.1381052 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 49080 |
| Negative (%) | 49.1% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | -3.3042718 |
|---|---|
| 5-th percentile | -1.7711657 |
| Q1 | -0.66635621 |
| median | 0.033505931 |
| Q3 | 0.66684176 |
| 95-th percentile | 1.6164746 |
| Maximum | 3.1381052 |
| Range | 6.442377 |
| Interquartile range (IQR) | 1.333198 |
Descriptive statistics
| Standard deviation | 1.00887 |
|---|---|
| Coefficient of variation (CV) | 4.0336885 × 1016 |
| Kurtosis | 0.10297923 |
| Mean | 2.5011104 × 10-17 |
| Median Absolute Deviation (MAD) | 0.66102645 |
| Skewness | -0.12943649 |
| Sum | 1.2505552 × 10-12 |
| Variance | 1.0178188 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.8267322674 | 40 | < 0.1% |
| 0.4821690638 | 40 | < 0.1% |
| 1.737255526 | 40 | < 0.1% |
| 0.4520600605 | 40 | < 0.1% |
| 0.06165941181 | 40 | < 0.1% |
| -0.9445612927 | 40 | < 0.1% |
| 0.9102230024 | 40 | < 0.1% |
| 0.04845390424 | 40 | < 0.1% |
| 0.9318174168 | 40 | < 0.1% |
| -0.3472561166 | 40 | < 0.1% |
| Other values (2490) | 99600 |
| Value | Count | Frequency (%) |
| -3.304271813 | 40 | |
| -3.198447758 | 40 | |
| -3.190631893 | 40 | |
| -3.080113133 | 40 | |
| -3.075595029 | 40 | |
| -2.94397706 | 40 | |
| -2.918619623 | 40 | |
| -2.909682574 | 40 | |
| -2.902263913 | 40 | |
| -2.881619913 | 40 |
| Value | Count | Frequency (%) |
| 3.138105178 | 40 | |
| 3.074965874 | 40 | |
| 2.983873978 | 40 | |
| 2.965653142 | 40 | |
| 2.950217337 | 40 | |
| 2.873848536 | 40 | |
| 2.870980718 | 40 | |
| 2.705246691 | 40 | |
| 2.655423754 | 40 | |
| 2.60107966 | 40 |
cluster
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
income_log
Real number (ℝ)
High correlation 
| Distinct | 2370 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 200 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.7678 |
| Minimum | 3.6689318 |
|---|---|
| Maximum | 11.577433 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 3.6689318 |
|---|---|
| 5-th percentile | 10.150928 |
| Q1 | 10.604605 |
| median | 10.821286 |
| Q3 | 11.003189 |
| 95-th percentile | 11.215611 |
| Maximum | 11.577433 |
| Range | 7.9085011 |
| Interquartile range (IQR) | 0.3985832 |
Descriptive statistics
| Standard deviation | 0.36769558 |
|---|---|
| Coefficient of variation (CV) | 0.034147697 |
| Kurtosis | 56.897823 |
| Mean | 10.7678 |
| Median Absolute Deviation (MAD) | 0.19398681 |
| Skewness | -3.8171406 |
| Sum | 1074626.5 |
| Variance | 0.13520004 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10.82128605 | 5040 | 5.0% |
| 10.57097995 | 40 | < 0.1% |
| 11.01434243 | 40 | < 0.1% |
| 11.10624827 | 40 | < 0.1% |
| 10.76437339 | 40 | < 0.1% |
| 10.75338973 | 40 | < 0.1% |
| 10.81120363 | 40 | < 0.1% |
| 10.32226696 | 40 | < 0.1% |
| 10.86965757 | 40 | < 0.1% |
| 11.06519263 | 40 | < 0.1% |
| Other values (2360) | 94400 | |
| (Missing) | 200 | 0.2% |
| Value | Count | Frequency (%) |
| 3.668931816 | 40 | |
| 8.463011519 | 40 | |
| 9.189303643 | 40 | |
| 9.229020801 | 40 | |
| 9.239347566 | 40 | |
| 9.244125184 | 40 | |
| 9.267630499 | 40 | |
| 9.27787191 | 40 | |
| 9.329909367 | 40 | |
| 9.356579553 | 40 |
| Value | Count | Frequency (%) |
| 11.57743289 | 40 | |
| 11.54585596 | 40 | |
| 11.53962718 | 40 | |
| 11.49728824 | 40 | |
| 11.49517088 | 40 | |
| 11.48488035 | 40 | |
| 11.47936641 | 40 | |
| 11.4514821 | 40 | |
| 11.44845617 | 40 | |
| 11.44487422 | 40 |
loan_log
Real number (ℝ)
High correlation  Zeros 
| Distinct | 2364 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.745456 |
| Minimum | 0 |
|---|---|
| Maximum | 13.109697 |
| Zeros | 3000 |
| Zeros (%) | 3.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10.68133 |
| Q1 | 11.884493 |
| median | 12.179895 |
| Q3 | 12.42188 |
| 95-th percentile | 12.668556 |
| Maximum | 13.109697 |
| Range | 13.109697 |
| Interquartile range (IQR) | 0.53738676 |
Descriptive statistics
| Standard deviation | 2.1218612 |
|---|---|
| Coefficient of variation (CV) | 0.18065379 |
| Kurtosis | 25.198248 |
| Mean | 11.745456 |
| Median Absolute Deviation (MAD) | 0.26015095 |
| Skewness | -5.0852944 |
| Sum | 1174545.6 |
| Variance | 4.5022948 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3000 | 3.0% |
| 12.1798946 | 2520 | 2.5% |
| 12.376305 | 40 | < 0.1% |
| 12.33467661 | 40 | < 0.1% |
| 12.62527605 | 40 | < 0.1% |
| 12.46657317 | 40 | < 0.1% |
| 12.51368384 | 40 | < 0.1% |
| 12.41308791 | 40 | < 0.1% |
| 11.96234079 | 40 | < 0.1% |
| 12.34852063 | 40 | < 0.1% |
| Other values (2354) | 94160 |
| Value | Count | Frequency (%) |
| 0 | 3000 | |
| 8.120779 | 40 | < 0.1% |
| 8.352238356 | 40 | < 0.1% |
| 8.37230845 | 40 | < 0.1% |
| 8.381856742 | 40 | < 0.1% |
| 8.590064399 | 40 | < 0.1% |
| 8.71706484 | 40 | < 0.1% |
| 8.7735626 | 40 | < 0.1% |
| 9.114050813 | 40 | < 0.1% |
| 9.211486715 | 40 | < 0.1% |
| Value | Count | Frequency (%) |
| 13.10969698 | 40 | |
| 13.07886426 | 40 | |
| 13.02296848 | 40 | |
| 13.01873851 | 40 | |
| 12.96308449 | 40 | |
| 12.95808038 | 40 | |
| 12.94291738 | 40 | |
| 12.90782951 | 40 | |
| 12.90580939 | 40 | |
| 12.89976435 | 40 |
month
Unsupported
Rejected  Unsupported 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
income_credit_interaction
Real number (ℝ)
High correlation 
| Distinct | 2459 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34016077 |
| Minimum | -8555859 |
|---|---|
| Maximum | 72239462 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 200 |
| Negative (%) | 0.2% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | -8555859 |
|---|---|
| 5-th percentile | 16788952 |
| Q1 | 27231788 |
| median | 34050623 |
| Q3 | 40945759 |
| 95-th percentile | 51241228 |
| Maximum | 72239462 |
| Range | 80795321 |
| Interquartile range (IQR) | 13713971 |
Descriptive statistics
| Standard deviation | 10534197 |
|---|---|
| Coefficient of variation (CV) | 0.30968289 |
| Kurtosis | 0.25726617 |
| Mean | 34016077 |
| Median Absolute Deviation (MAD) | 6848024.1 |
| Skewness | 0.01471746 |
| Sum | 3.4016077 × 1012 |
| Variance | 1.1096931 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34050622.6 | 360 | 0.4% |
| 34401143.71 | 160 | 0.2% |
| 30745709.23 | 120 | 0.1% |
| 35402632.62 | 120 | 0.1% |
| 33049133.7 | 120 | 0.1% |
| 32848835.92 | 120 | 0.1% |
| 32598463.7 | 120 | 0.1% |
| 29744220.33 | 80 | 0.1% |
| 31446751.46 | 80 | 0.1% |
| 29794294.77 | 80 | 0.1% |
| Other values (2449) | 98640 |
| Value | Count | Frequency (%) |
| -8555859 | 40 | |
| -6346967.94 | 40 | |
| -2696041.68 | 40 | |
| -1209817.94 | 40 | |
| -546559.52 | 40 | |
| 26059.22 | 40 | |
| 3627239.8 | 40 | |
| 6464204.96 | 40 | |
| 6657764.4 | 40 | |
| 6723789.6 | 40 |
| Value | Count | Frequency (%) |
| 72239462.13 | 40 | |
| 70008084 | 40 | |
| 69839495.2 | 40 | |
| 69306997.76 | 40 | |
| 68903878.1 | 40 | |
| 68669207.79 | 40 | |
| 65253821.04 | 40 | |
| 64161891.69 | 40 | |
| 63861822.22 | 40 | |
| 63700325.4 | 40 |
credit_to_income
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 2422 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.9815238 |
| Minimum | -146.93555 |
|---|---|
| Maximum | 4421.2385 |
| Zeros | 3000 |
| Zeros (%) | 3.0% |
| Negative | 200 |
| Negative (%) | 0.2% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | -146.93555 |
|---|---|
| 5-th percentile | 0.76716213 |
| Q1 | 2.7351999 |
| median | 3.9170454 |
| Q3 | 5.3352779 |
| 95-th percentile | 9.0055343 |
| Maximum | 4421.2385 |
| Range | 4568.174 |
| Interquartile range (IQR) | 2.600078 |
Descriptive statistics
| Standard deviation | 88.467631 |
|---|---|
| Coefficient of variation (CV) | 14.790149 |
| Kurtosis | 2478.8622 |
| Mean | 5.9815238 |
| Median Absolute Deviation (MAD) | 1.2778802 |
| Skewness | 49.723255 |
| Sum | 598152.38 |
| Variance | 7826.5217 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3000 | 3.0% |
| 3.890755739 | 200 | 0.2% |
| 4.139720874 | 40 | < 0.1% |
| 5.342704007 | 40 | < 0.1% |
| 7.481290108 | 40 | < 0.1% |
| 5.007467681 | 40 | < 0.1% |
| 4.937759824 | 40 | < 0.1% |
| 2.76275997 | 40 | < 0.1% |
| 6.069575793 | 40 | < 0.1% |
| 2.290108463 | 40 | < 0.1% |
| Other values (2412) | 96480 |
| Value | Count | Frequency (%) |
| -146.9355479 | 40 | < 0.1% |
| -120.8173668 | 40 | < 0.1% |
| -46.40389929 | 40 | < 0.1% |
| -17.87461673 | 40 | < 0.1% |
| -10.59075407 | 40 | < 0.1% |
| 0 | 3000 | |
| 0.05693434884 | 40 | < 0.1% |
| 0.06587706707 | 40 | < 0.1% |
| 0.06967629903 | 40 | < 0.1% |
| 0.09076292571 | 40 | < 0.1% |
| Value | Count | Frequency (%) |
| 4421.23846 | 40 | |
| 59.49816312 | 40 | |
| 33.67693169 | 40 | |
| 31.2933496 | 40 | |
| 24.23378343 | 40 | |
| 23.76321732 | 40 | |
| 22.0952875 | 40 | |
| 21.82118979 | 40 | |
| 20.94962392 | 40 | |
| 20.60981454 | 40 |
txn_intensity
Real number (ℝ)
High correlation  Skewed 
| Distinct | 2500 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0011140181 |
| Minimum | -5.9649123 |
|---|---|
| Maximum | 3.4414946 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 2160 |
| Negative (%) | 2.2% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | -5.9649123 |
|---|---|
| 5-th percentile | 0.0006632706 |
| Q1 | 0.001049692 |
| median | 0.001466099 |
| Q3 | 0.0021517368 |
| 95-th percentile | 0.0052566921 |
| Maximum | 3.4414946 |
| Range | 9.4064069 |
| Interquartile range (IQR) | 0.0011020448 |
Descriptive statistics
| Standard deviation | 0.13934555 |
|---|---|
| Coefficient of variation (CV) | 125.08373 |
| Kurtosis | 1490.2082 |
| Mean | 0.0011140181 |
| Median Absolute Deviation (MAD) | 0.00048570727 |
| Skewness | -25.322341 |
| Sum | 111.40181 |
| Variance | 0.019417181 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.002446114767 | 40 | < 0.1% |
| 0.001357544334 | 40 | < 0.1% |
| 0.001462635157 | 40 | < 0.1% |
| 0.003659688392 | 40 | < 0.1% |
| 0.001153301662 | 40 | < 0.1% |
| 0.001807668854 | 40 | < 0.1% |
| 0.001388995775 | 40 | < 0.1% |
| 0.001486054339 | 40 | < 0.1% |
| 0.001185780125 | 40 | < 0.1% |
| 0.0009577868738 | 40 | < 0.1% |
| Other values (2490) | 99600 |
| Value | Count | Frequency (%) |
| -5.964912281 | 40 | |
| -0.4202972834 | 40 | |
| -0.3215051203 | 40 | |
| -0.1018564153 | 40 | |
| -0.07884761183 | 40 | |
| -0.07525235488 | 40 | |
| -0.07378258731 | 40 | |
| -0.0690157586 | 40 | |
| -0.04340939755 | 40 | |
| -0.03836071862 | 40 |
| Value | Count | Frequency (%) |
| 3.441494592 | 40 | |
| 0.7416825599 | 40 | |
| 0.3455889412 | 40 | |
| 0.193143409 | 40 | |
| 0.1467812959 | 40 | |
| 0.09756561883 | 40 | |
| 0.09644701636 | 40 | |
| 0.07740324594 | 40 | |
| 0.07547961002 | 40 | |
| 0.06813589587 | 40 |
cluster_label
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 27200 | |
| 2 | 26800 | |
| 3 | 24360 | |
| 0 | 21640 |
Interactions
Correlations
| account_type | avg_monthly_balance | branch_rating | cluster | cluster_label | credit_score | credit_to_income | credit_utilization | default_history | device_type | employment_status | has_credit_card | income | income_credit_interaction | income_log | loan_amount | loan_approved | loan_log | loan_purpose | num_transactions | pca_1 | pca_2 | potential_data_leakage | risk_segment | transaction_amount | transaction_type | txn_intensity | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| account_type | 1.000 | 0.051 | 0.047 | 0.044 | 0.044 | 0.032 | 0.029 | 0.069 | 0.022 | 0.002 | 0.024 | 0.009 | 0.050 | 0.063 | 0.043 | 0.044 | 0.047 | 0.034 | 0.042 | 0.059 | 0.042 | 0.053 | 0.059 | 0.040 | 0.005 | 0.000 | 0.037 |
| avg_monthly_balance | 0.051 | 1.000 | 0.059 | 0.382 | 0.382 | -0.007 | 0.005 | 0.026 | 0.097 | 0.005 | 0.050 | 0.078 | 0.035 | 0.032 | 0.035 | 0.034 | 0.068 | 0.034 | 0.057 | 0.001 | 0.624 | 0.038 | 0.059 | 0.046 | -0.002 | 0.000 | -0.805 |
| branch_rating | 0.047 | 0.059 | 1.000 | 0.041 | 0.041 | 0.058 | 0.039 | 0.058 | 0.033 | 0.005 | 0.040 | 0.036 | 0.053 | 0.046 | 0.039 | 0.054 | 0.013 | 0.046 | 0.044 | 0.048 | 0.051 | 0.068 | 0.024 | 0.060 | 0.003 | 0.000 | 0.038 |
| cluster | 0.044 | 0.382 | 0.041 | 1.000 | 1.000 | 0.382 | 0.035 | 0.068 | 0.019 | 0.005 | 0.027 | 0.015 | 0.296 | 0.266 | 0.244 | 0.418 | 0.071 | 0.391 | 0.030 | 0.064 | 0.403 | 0.449 | 0.084 | 0.028 | 0.003 | 0.003 | 0.045 |
| cluster_label | 0.044 | 0.382 | 0.041 | 1.000 | 1.000 | 0.382 | 0.035 | 0.068 | 0.019 | 0.005 | 0.027 | 0.015 | 0.296 | 0.266 | 0.244 | 0.418 | 0.071 | 0.391 | 0.030 | 0.064 | 0.403 | 0.449 | 0.084 | 0.028 | 0.003 | 0.003 | 0.045 |
| credit_score | 0.032 | -0.007 | 0.058 | 0.382 | 0.382 | 1.000 | 0.007 | -0.005 | 0.061 | 0.000 | 0.051 | 0.088 | 0.032 | 0.252 | 0.031 | 0.016 | 0.041 | 0.016 | 0.060 | 0.004 | 0.422 | 0.019 | 0.049 | 0.057 | -0.002 | 0.000 | 0.007 |
| credit_to_income | 0.029 | 0.005 | 0.039 | 0.035 | 0.035 | 0.007 | 1.000 | -0.023 | 0.020 | 0.000 | 0.028 | 0.019 | -0.566 | -0.545 | -0.575 | 0.768 | 0.019 | 0.768 | 0.039 | 0.006 | 0.020 | 0.975 | 0.020 | 0.028 | -0.005 | 0.000 | -0.010 |
| credit_utilization | 0.069 | 0.026 | 0.058 | 0.068 | 0.068 | -0.005 | -0.023 | 1.000 | 0.077 | 0.000 | 0.067 | 0.071 | 0.007 | 0.001 | 0.007 | -0.022 | 0.070 | -0.022 | 0.051 | -0.037 | 0.013 | -0.018 | 0.071 | 0.050 | -0.004 | 0.002 | -0.016 |
| default_history | 0.022 | 0.097 | 0.033 | 0.019 | 0.019 | 0.061 | 0.020 | 0.077 | 1.000 | 0.004 | 0.024 | 0.026 | 0.045 | 0.025 | 0.034 | 0.049 | 0.032 | 0.021 | 0.028 | 0.059 | 0.052 | 0.045 | 0.027 | 0.036 | 0.003 | 0.000 | 0.034 |
| device_type | 0.002 | 0.005 | 0.005 | 0.005 | 0.005 | 0.000 | 0.000 | 0.000 | 0.004 | 1.000 | 0.003 | 0.001 | 0.003 | 0.002 | 0.000 | 0.000 | 0.000 | 0.001 | 0.002 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.003 | 0.005 | 0.000 |
| employment_status | 0.024 | 0.050 | 0.040 | 0.027 | 0.027 | 0.051 | 0.028 | 0.067 | 0.024 | 0.003 | 1.000 | 0.013 | 0.072 | 0.073 | 0.051 | 0.053 | 0.027 | 0.036 | 0.034 | 0.056 | 0.041 | 0.061 | 0.033 | 0.028 | 0.000 | 0.000 | 0.037 |
| has_credit_card | 0.009 | 0.078 | 0.036 | 0.015 | 0.015 | 0.088 | 0.019 | 0.071 | 0.026 | 0.001 | 0.013 | 1.000 | 0.034 | 0.059 | 0.025 | 0.076 | 0.028 | 0.041 | 0.040 | 0.052 | 0.061 | 0.048 | 0.018 | 0.015 | 0.000 | 0.004 | 0.034 |
| income | 0.050 | 0.035 | 0.053 | 0.296 | 0.296 | 0.032 | -0.566 | 0.007 | 0.045 | 0.003 | 0.072 | 0.034 | 1.000 | 0.968 | 1.000 | -0.006 | 0.079 | -0.006 | 0.058 | -0.008 | 0.505 | -0.640 | 0.074 | 0.033 | 0.003 | 0.003 | -0.020 |
| income_credit_interaction | 0.063 | 0.032 | 0.046 | 0.266 | 0.266 | 0.252 | -0.545 | 0.001 | 0.025 | 0.002 | 0.073 | 0.059 | 0.968 | 1.000 | 0.968 | -0.002 | 0.057 | -0.002 | 0.055 | -0.007 | 0.590 | -0.613 | 0.057 | 0.046 | 0.002 | 0.000 | -0.016 |
| income_log | 0.043 | 0.035 | 0.039 | 0.244 | 0.244 | 0.031 | -0.575 | 0.007 | 0.034 | 0.000 | 0.051 | 0.025 | 1.000 | 0.968 | 1.000 | -0.007 | 0.045 | -0.007 | 0.044 | -0.007 | 0.502 | -0.638 | 0.045 | 0.037 | 0.002 | 0.000 | -0.020 |
| loan_amount | 0.044 | 0.034 | 0.054 | 0.418 | 0.418 | 0.016 | 0.768 | -0.022 | 0.049 | 0.000 | 0.053 | 0.076 | -0.006 | -0.002 | -0.007 | 1.000 | 0.126 | 1.000 | 0.050 | 0.001 | 0.392 | 0.734 | 0.167 | 0.073 | -0.006 | 0.002 | -0.024 |
| loan_approved | 0.047 | 0.068 | 0.013 | 0.071 | 0.071 | 0.041 | 0.019 | 0.070 | 0.032 | 0.000 | 0.027 | 0.028 | 0.079 | 0.057 | 0.045 | 0.126 | 1.000 | 0.152 | 0.038 | 0.049 | 0.060 | 0.072 | 0.946 | 0.004 | 0.010 | 0.007 | 0.034 |
| loan_log | 0.034 | 0.034 | 0.046 | 0.391 | 0.391 | 0.016 | 0.768 | -0.022 | 0.021 | 0.001 | 0.036 | 0.041 | -0.006 | -0.002 | -0.007 | 1.000 | 0.152 | 1.000 | 0.042 | 0.001 | 0.392 | 0.734 | 0.174 | 0.036 | -0.006 | 0.000 | -0.024 |
| loan_purpose | 0.042 | 0.057 | 0.044 | 0.030 | 0.030 | 0.060 | 0.039 | 0.051 | 0.028 | 0.002 | 0.034 | 0.040 | 0.058 | 0.055 | 0.044 | 0.050 | 0.038 | 0.042 | 1.000 | 0.067 | 0.056 | 0.053 | 0.026 | 0.033 | 0.003 | 0.000 | 0.038 |
| num_transactions | 0.059 | 0.001 | 0.048 | 0.064 | 0.064 | 0.004 | 0.006 | -0.037 | 0.059 | 0.000 | 0.056 | 0.052 | -0.008 | -0.007 | -0.007 | 0.001 | 0.049 | 0.001 | 0.067 | 1.000 | -0.004 | 0.008 | 0.048 | 0.049 | -0.005 | 0.000 | 0.326 |
| pca_1 | 0.042 | 0.624 | 0.051 | 0.403 | 0.403 | 0.422 | 0.020 | 0.013 | 0.052 | 0.000 | 0.041 | 0.061 | 0.505 | 0.590 | 0.502 | 0.392 | 0.060 | 0.392 | 0.056 | -0.004 | 1.000 | -0.005 | 0.061 | 0.045 | -0.003 | 0.004 | -0.478 |
| pca_2 | 0.053 | 0.038 | 0.068 | 0.449 | 0.449 | 0.019 | 0.975 | -0.018 | 0.045 | 0.000 | 0.061 | 0.048 | -0.640 | -0.613 | -0.638 | 0.734 | 0.072 | 0.734 | 0.053 | 0.008 | -0.005 | 1.000 | 0.116 | 0.057 | -0.006 | 0.005 | -0.033 |
| potential_data_leakage | 0.059 | 0.059 | 0.024 | 0.084 | 0.084 | 0.049 | 0.020 | 0.071 | 0.027 | 0.000 | 0.033 | 0.018 | 0.074 | 0.057 | 0.045 | 0.167 | 0.946 | 0.174 | 0.026 | 0.048 | 0.061 | 0.116 | 1.000 | 0.004 | 0.009 | 0.005 | 0.034 |
| risk_segment | 0.040 | 0.046 | 0.060 | 0.028 | 0.028 | 0.057 | 0.028 | 0.050 | 0.036 | 0.000 | 0.028 | 0.015 | 0.033 | 0.046 | 0.037 | 0.073 | 0.004 | 0.036 | 0.033 | 0.049 | 0.045 | 0.057 | 0.004 | 1.000 | 0.000 | 0.004 | 0.037 |
| transaction_amount | 0.005 | -0.002 | 0.003 | 0.003 | 0.003 | -0.002 | -0.005 | -0.004 | 0.003 | 0.003 | 0.000 | 0.000 | 0.003 | 0.002 | 0.002 | -0.006 | 0.010 | -0.006 | 0.003 | -0.005 | -0.003 | -0.006 | 0.009 | 0.000 | 1.000 | 0.535 | -0.002 |
| transaction_type | 0.000 | 0.000 | 0.000 | 0.003 | 0.003 | 0.000 | 0.000 | 0.002 | 0.000 | 0.005 | 0.000 | 0.004 | 0.003 | 0.000 | 0.000 | 0.002 | 0.007 | 0.000 | 0.000 | 0.000 | 0.004 | 0.005 | 0.005 | 0.004 | 0.535 | 1.000 | 0.002 |
| txn_intensity | 0.037 | -0.805 | 0.038 | 0.045 | 0.045 | 0.007 | -0.010 | -0.016 | 0.034 | 0.000 | 0.037 | 0.034 | -0.020 | -0.016 | -0.020 | -0.024 | 0.034 | -0.024 | 0.038 | 0.326 | -0.478 | -0.033 | 0.034 | 0.037 | -0.002 | 0.002 | 1.000 |
Missing values
Sample
| customer_id | account_type | income | credit_score | employment_status | risk_segment | avg_monthly_balance | num_transactions | transaction_amount | transaction_type | transaction_date | txn_location | device_type | has_credit_card | loan_approved | loan_amount | loan_purpose | credit_utilization | default_history | branch_rating | potential_data_leakage | pca_1 | pca_2 | cluster | income_log | loan_log | month | income_credit_interaction | credit_to_income | txn_intensity | cluster_label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | CUST00001 | Checking | 38985.86 | 727.0 | Self-employed | Medium | 13080.97 | 32 | 4049.11 | ATM | 2025-01-14 | Moralesfort | Mobile | 1 | 1 | 230617.62 | Personal | 0.92 | 1 | 2 | 0 | -0.204152 | 0.826732 | 2 | 10.57098 | 12.348521 | 2025-01 | 28342720.22 | 5.915265 | 0.002446 | 2 |
| 1 | CUST00001 | Checking | 38985.86 | 727.0 | Self-employed | Medium | 13080.97 | 32 | 3626.64 | UPI | 2025-01-17 | Deborahburgh | Web | 1 | 1 | 230617.62 | Personal | 0.92 | 1 | 2 | 0 | -0.204152 | 0.826732 | 2 | 10.57098 | 12.348521 | 2025-01 | 28342720.22 | 5.915265 | 0.002446 | 2 |
| 2 | CUST00001 | Checking | 38985.86 | 727.0 | Self-employed | Medium | 13080.97 | 32 | 7761.47 | UPI | 2025-01-18 | South Adam | Branch | 1 | 1 | 230617.62 | Personal | 0.92 | 1 | 2 | 0 | -0.204152 | 0.826732 | 2 | 10.57098 | 12.348521 | 2025-01 | 28342720.22 | 5.915265 | 0.002446 | 2 |
| 3 | CUST00001 | Checking | 38985.86 | 727.0 | Self-employed | Medium | 13080.97 | 32 | 6930.75 | ATM | 2025-01-19 | Marymouth | Branch | 1 | 1 | 230617.62 | Personal | 0.92 | 1 | 2 | 0 | -0.204152 | 0.826732 | 2 | 10.57098 | 12.348521 | 2025-01 | 28342720.22 | 5.915265 | 0.002446 | 2 |
| 4 | CUST00001 | Checking | 38985.86 | 727.0 | Self-employed | Medium | 13080.97 | 32 | 19373.61 | Debit | 2025-01-20 | Lake Susanmouth | Branch | 1 | 1 | 230617.62 | Personal | 0.92 | 1 | 2 | 0 | -0.204152 | 0.826732 | 2 | 10.57098 | 12.348521 | 2025-01 | 28342720.22 | 5.915265 | 0.002446 | 2 |
| 5 | CUST00001 | Checking | 38985.86 | 727.0 | Self-employed | Medium | 13080.97 | 32 | 7150.20 | UPI | 2025-01-20 | Kellyborough | Mobile | 1 | 1 | 230617.62 | Personal | 0.92 | 1 | 2 | 0 | -0.204152 | 0.826732 | 2 | 10.57098 | 12.348521 | 2025-01 | 28342720.22 | 5.915265 | 0.002446 | 2 |
| 6 | CUST00001 | Checking | 38985.86 | 727.0 | Self-employed | Medium | 13080.97 | 32 | 1995.91 | ATM | 2025-01-20 | South Rogerton | Mobile | 1 | 1 | 230617.62 | Personal | 0.92 | 1 | 2 | 0 | -0.204152 | 0.826732 | 2 | 10.57098 | 12.348521 | 2025-01 | 28342720.22 | 5.915265 | 0.002446 | 2 |
| 7 | CUST00001 | Checking | 38985.86 | 727.0 | Self-employed | Medium | 13080.97 | 32 | 8947.42 | POS | 2025-01-20 | West Roymouth | Web | 1 | 1 | 230617.62 | Personal | 0.92 | 1 | 2 | 0 | -0.204152 | 0.826732 | 2 | 10.57098 | 12.348521 | 2025-01 | 28342720.22 | 5.915265 | 0.002446 | 2 |
| 8 | CUST00001 | Checking | 38985.86 | 727.0 | Self-employed | Medium | 13080.97 | 32 | 705.97 | UPI | 2025-01-22 | Martinstad | Branch | 1 | 1 | 230617.62 | Personal | 0.92 | 1 | 2 | 0 | -0.204152 | 0.826732 | 2 | 10.57098 | 12.348521 | 2025-01 | 28342720.22 | 5.915265 | 0.002446 | 2 |
| 9 | CUST00001 | Checking | 38985.86 | 727.0 | Self-employed | Medium | 13080.97 | 32 | 66056.37 | Credit | 2025-01-25 | Lake Nicholaschester | Web | 1 | 1 | 230617.62 | Personal | 0.92 | 1 | 2 | 0 | -0.204152 | 0.826732 | 2 | 10.57098 | 12.348521 | 2025-01 | 28342720.22 | 5.915265 | 0.002446 | 2 |
| customer_id | account_type | income | credit_score | employment_status | risk_segment | avg_monthly_balance | num_transactions | transaction_amount | transaction_type | transaction_date | txn_location | device_type | has_credit_card | loan_approved | loan_amount | loan_purpose | credit_utilization | default_history | branch_rating | potential_data_leakage | pca_1 | pca_2 | cluster | income_log | loan_log | month | income_credit_interaction | credit_to_income | txn_intensity | cluster_label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99990 | CUST09998 | Salary | 62401.16 | 755.0 | Employed | Low | 6957.69 | 29 | 11081.77 | ATM | 2025-03-21 | Stewartfurt | Branch | 1 | 0 | 42718.3 | Home | 0.02 | 0 | 4 | 1 | -0.485082 | -2.000292 | 0 | 11.041355 | 10.662406 | 2025-03 | 47112875.8 | 0.684564 | 0.004167 | 0 |
| 99991 | CUST09998 | Salary | 62401.16 | 755.0 | Employed | Low | 6957.69 | 29 | 2508.48 | POS | 2025-03-22 | North Christopher | Web | 1 | 0 | 42718.3 | Home | 0.02 | 0 | 4 | 1 | -0.485082 | -2.000292 | 0 | 11.041355 | 10.662406 | 2025-03 | 47112875.8 | 0.684564 | 0.004167 | 0 |
| 99992 | CUST09998 | Salary | 62401.16 | 755.0 | Employed | Low | 6957.69 | 29 | 8548.16 | UPI | 2025-03-23 | South Thomasburgh | Branch | 1 | 0 | 42718.3 | Home | 0.02 | 0 | 4 | 1 | -0.485082 | -2.000292 | 0 | 11.041355 | 10.662406 | 2025-03 | 47112875.8 | 0.684564 | 0.004167 | 0 |
| 99993 | CUST09998 | Salary | 62401.16 | 755.0 | Employed | Low | 6957.69 | 29 | 53260.52 | Credit | 2025-03-24 | Jonesside | Branch | 1 | 0 | 42718.3 | Home | 0.02 | 0 | 4 | 1 | -0.485082 | -2.000292 | 0 | 11.041355 | 10.662406 | 2025-03 | 47112875.8 | 0.684564 | 0.004167 | 0 |
| 99994 | CUST09998 | Salary | 62401.16 | 755.0 | Employed | Low | 6957.69 | 29 | 7456.73 | UPI | 2025-03-27 | Port Tracyport | Branch | 1 | 0 | 42718.3 | Home | 0.02 | 0 | 4 | 1 | -0.485082 | -2.000292 | 0 | 11.041355 | 10.662406 | 2025-03 | 47112875.8 | 0.684564 | 0.004167 | 0 |
| 99995 | CUST09998 | Salary | 62401.16 | 755.0 | Employed | Low | 6957.69 | 29 | 9016.51 | ATM | 2025-03-29 | Madisonburgh | Branch | 1 | 0 | 42718.3 | Home | 0.02 | 0 | 4 | 1 | -0.485082 | -2.000292 | 0 | 11.041355 | 10.662406 | 2025-03 | 47112875.8 | 0.684564 | 0.004167 | 0 |
| 99996 | CUST09998 | Salary | 62401.16 | 755.0 | Employed | Low | 6957.69 | 29 | 9217.11 | POS | 2025-03-04 | Danachester | Mobile | 1 | 0 | 42718.3 | Home | 0.02 | 0 | 4 | 1 | -0.485082 | -2.000292 | 0 | 11.041355 | 10.662406 | 2025-03 | 47112875.8 | 0.684564 | 0.004167 | 0 |
| 99997 | CUST09998 | Salary | 62401.16 | 755.0 | Employed | Low | 6957.69 | 29 | 13560.13 | ATM | 2025-05-04 | New Deanna | Mobile | 1 | 0 | 42718.3 | Home | 0.02 | 0 | 4 | 1 | -0.485082 | -2.000292 | 0 | 11.041355 | 10.662406 | 2025-05 | 47112875.8 | 0.684564 | 0.004167 | 0 |
| 99998 | CUST09998 | Salary | 62401.16 | 755.0 | Employed | Low | 6957.69 | 29 | 4030.95 | UPI | 2025-07-04 | New Mary | Mobile | 1 | 0 | 42718.3 | Home | 0.02 | 0 | 4 | 1 | -0.485082 | -2.000292 | 0 | 11.041355 | 10.662406 | 2025-07 | 47112875.8 | 0.684564 | 0.004167 | 0 |
| 99999 | CUST09998 | Salary | 62401.16 | 755.0 | Employed | Low | 6957.69 | 29 | 792.06 | UPI | 2025-07-04 | West Garyfurt | Branch | 1 | 0 | 42718.3 | Home | 0.02 | 0 | 4 | 1 | -0.485082 | -2.000292 | 0 | 11.041355 | 10.662406 | 2025-07 | 47112875.8 | 0.684564 | 0.004167 | 0 |